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Low-Signal-Energy Asymptotics of Capacity 
and Mutual Information for the Discrete-Time 

Poisson Channel 

Alfonso Martinez 
Abstract 

The first terms of the low-signal-energy asymptotics for the mutual information in the discrete-time Poisson 
channel are derived and compared to an asymptotic expression of the capacity. In the presence of non-zero additive 
noise (either Poisson or geometric), the mutual information is concave at zero signal-energy and the minimum energy 
per bit is not attained at zero capacity. Fixed signal constellations which scale with the signal energy do not attain 
the minimum energy per bit. The minimum energy per bit is zero when additive Poisson noise is present and e n log 2 
when additive geometric noise of mean e n is present. 

I. Motivation and Notation 

In the complex-valued Gaussian channel with signal-to-noise ratio SNR the mutual information of very general 
constellations (e. g. zero-mean with uncorrelated real and imaginary parts each of energy h [1]) has the same 
low-SNR asymptotics as the channel capacity, namely SNR — ^ SNR 2 +o(SNR 2 ). These constellations also attain 
the minimum bit-energy-to-noise-variance ratio of -1.59 dB at vanishing SNR. A natural question concerns the 
extent to which this universality extends to other common channel models. We consider here the discrete-time 
Poisson channel, frequently used to represent optical communication channels, and quantity the gap between the 
channel capacity and the mutual information for fixed signal constellations. As a by-product of our analysis, we 
also determine the asymptotic form of the capacity at vanishing signal energy. 

Consider a memoryless channel with input X and output Y given by the sum 

Y = S{X) + Z (1) 

of a noise Z and a signal component S(X), itself a function of the input X. The input X is a non-negative real 
number (i. e. it has units of energy), drawn from a unit-energy set X according to a probability distribution P(x). 
We let S(X) be distributed according to a Poisson distribution of parameter e s X, where e s is an average signal 
energy. The output components S(X), Z, and Y are nonnegative integers. 
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We study three channel models: noiseless, with Z = 0; additive Poisson noise, where Z follows a Poisson 
distribution of mean e n > 0; and additive geometric noise, with Z distributed according to a geometric distribution 
of mean e„ > 0. With additive Poisson noise the channel transition probability, denoted by Q(y\x), is given by 

Q(y\x) = e-^+^ {£sX+ ] £n)y , (2) 

where e n > 0. For the channel with geometric noise, we have 

A e~ x ( e \ v ( x ( 1 + ir)) 1 

Remark that the model with additive geometric noise arises in representations of electromagnetic radiation as a 
photon gas (2). 

In this letter, we compute the minimum energy per bit for these models. We also study the asymptotics of the 
mutual information l(X; S(X) + Z) at low e s and compare them with the channel capacity C(e s ) at energy e s . 
The main results are presented in the next section; the proofs can be found in the appendices. 

II. Main Results and Discussion 

A. Capacity Asymptotics and Capacity per Unit Cost 

A closed-form expression for the capacity C(e s ) of the discrete-time Poisson channel is not known. For the 
noiseless channel (i. e. with Z = 0), the best known firm upper bound was derived in y], and is given by 

V^-l\(E. + !)*' + iX 



As for lower bounds, binary modulation attains a high mutual information at low values of e s . Specifically, let 
the symbols be located at x — and x = l/p, and be respectively used with probabilities 1—p and p. We denote 
the mutual information attained by such modulation by Ib(p)- A trite computation gives 

Ib(p) — — (p — pe~~) \ogp — s s e~~ — (1 — p + pe~~) log(l — p + pe~~). (5) 

Setting p = e s , and using the Taylor expansion of the logarithm log(l + x) around x = in both Eq. © and 
Eq. (O, we have that 

-(1 - fT 1 )^ loge, + 0(e.) < C(e s ) < -e s \oge s + 0(e s ). (6) 
Therefore, the capacity C(e s ) of the noiseless discrete-time Poisson channel behaves as 0(— e s loge s ) at vanishing 

Eg. 

Moreover, a similar reasoning shows that flash signalling with p — — e s loge s (for e s <C 1) asymptotically 
behaves as 

7 b (-£ s loge s ) = -e s \oge s +o(-e s \oge s ) (7) 
for low e s . Combining Eqs. © and (0, we obtain the following 
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Theorem 1. For vanishing e s the capacity C(e s ) behaves as 

C(e s ) = -e s \oge s + o(-e s loge s ). (8) 

This result complements the asymptotic behaviour for very large values of e s , which was established in J4], 0, 

C(e.) = -log£ s + o(loge s ). (9) 

Observe that Eq. (0 implies that the capacity per unit energy Ci, 

C 1= su P ^. (10) 

is infinite for the discrete-time Poisson channel. This well-known result had been obtained by Verdu |5 1 by exploiting 

a simple formula for Ci in channels which have a zero-energy symbol (x — in our case), namely 

D(Q(y\x)\\Q(y\x = 0)) 
Ci=sup , (11) 

x &sX 

where D(Q(y\x)\\Q(y\0)j is the divergence between the transition probabilities Q(y\x) for arbitrary input x and 
zero input x — 0. 

By definition, the minimum energy per bit eb, m in = inf £s c ^f ; , where the capacity is measured in bits, is given 

Applied to the family of channels we consider, we have 

Theorem 2. In the absence of additive noise, i. e. for Z = 0, or in the presence of additive Poisson noise, the 
minimum energy per bit is £b,min = 0- Equivalently the capacity per unit cost is C\ = oo. 

With additive geometric noise, the minimum energy per bit is £b,min = e« log 2 and the capacity per unit cost is 
C\ = e" 1 nats. 

The latter result is new. Remarkably, the minimum energy per bit has the same form as in the Gaussian channel, 
for a minimum ratio £b/s n of -1.59 dB. 

Proof: The proof can be found in Appendix [A] ■ 

B. Mutual Information Asymptotics 

We now move on to study the asymptotics of the mutual information l(X;S(X) + Z) (in nats) at low e s for 
fixed unit-energy constellations X. The output S(X) is distributed according to a Poisson distribution of parameter 
e 8 X. We determine the first two coefficients c\ and C2 in the Taylor expansion around e s = 0, that is 

l(X; S(X) + Z)= c l£s + c 2 e 2 s + o(e 2 s ). (12) 

Also, we define the energy per bit Eb as Eb — i(x-Y) ^ - 

Denoting the first- and second-order moments of the constellation by p\ (we often have [i\ = 1) and \i<x 
respectively, we have 
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Theorem 3. In the absence of additive noise, i. e. for Z = 0, the mutual information behaves at low e s as Eq. ( 1121 ) 
with 

c l = y^P(x)xlog—, c 2 = - ( H2 - Mi - M2log^| J . (13) 

As long as e n > 0, regardless of whether Z has a Poisson or a geometric distribution, the coefficients in Eq. (112b 
are 

cx = 0, c 2 = ii^il. (14) 

Proof: Eq. ( fT3l is proved in Appendix [B] Eq. ( TBb in Appendices [C] and [D] for Poisson and geometric noise 
respectively. ■ 

C. Discussion 

In the presence of non-zero additive noise, the mutual information is concave at zero signal-energy (because 
ci = 0, C2 > 0) and the minimum energy per bit is not attained at zero capacity. This effect can be seen in Fig. Q] 
which depicts the energy per bit as a function of the mutual information for several positive values of e n . The 
modulation depicted is uniform pulse-energy modulation (PEM), e. g. 2 m points uniformly located between and 
2 with spacing l/2 m . General signal constellations do not attain the minimum energy-per-bit at vanishing signal 
energy. The determination of the minimum energy per bit attained by these modulations is an open problem. 

[Fig. 1 about here.] 

Moreover, since there is no coefficient in — e s loge s in Eq. ( fT2l . these modulations do not attain the minimum 
energy per bit £b jm i n . This is true even for the noiseless channel, for which ci ^ 0. In this case, binary modulation 
at points x = and x = 1/p respectively used with probabilities 1 — p and p, with fixed p, has coefficients Ci and 

C2 

ci = -log(p), c 2 = . (15) 

2p 

In the limit p — > 0, c\ — > oo and the bit energy at zero capacity, e^q, approaches 0, the minimum energy per bit. 
Fig. |2] depicts as a function of the mutual information for various fixed values of p and for p = —e s log£ s . For 
comparison, we also include the value of Sb corresponding to the upper bound in Eq. Even though Eb,o indeed 
approaches zero, it does so rather slowly. Also, the gap between the energy per bit corresponding to Jb(— £ s loge s ) 
and £fc im j n closes slowly. Numerical evaluation shows that it is only for values of e s below 10~ 307 (!) that I\> 
exceeds 0.99 • EglogeJ 1 . Even though one eventually has C(e s ) ~ — e s loge s , convergence to the limit is very 
slow. This fact, together with the concave nature of the mutual information I(X;Y) at zero e s for nonzero additive 
noise, suggests that the asymptotic analysis of the capacity and the mutual information in the discrete-time Poisson 
channel fails to capture the key features of these quantities. This behaviour stands in contrast with the Gaussian 
channel, where asymptotic expansions give an accurate representation of the capacity and the mutual information 

m. 



August 20, 2008 



DRAFT 



5 



[Fig. 2 about here.] 

Appendix A 
Capacity per Unit Energy 

We first consider the case with additive Poisson noise. Using Eq. © for Q(-\-) and the definition of divergence, 
we have 

D(Q(y\x)\\Q(y\0)) = -e s x + (e s x + e n ) log + £ " . (16) 

Hence, 

d = sup (-l + (l + ±L) log £sX + £ " ) = oo. (17) 

a; V V £ sX J e n ) 

We now consider the channel with additive geometric noise. Using Eq. (O for Q( | ) and the definition of 
divergence, we have 

D{Q(y\x)\\Q(y\0)) = £Q(y|*)log( ^e"^ j , (18) 
y \ 1=0 / 

where a = e s x(l + ^-). Let us define P(l) — e~ a \ and the quantity q(y) — YuI=qP(1)> e - tne cumulative 

distribution function of a Poisson random variable with mean a. 

Moving the exponential out of the logarithm, we obtain 

D{Q(y\x)\\Q(y\0)) = — + £ Q(l^) log( g (y)). (19) 

V 

Hence, the capacity per unit energy is given by 

Ci = h sup — 2 . (20) 

£n x SsX 

Since q(y) < 1, its logarithm is always non-positive, and 



Ci < — . (21) 



The proof is completed by proving that 

e'en. 



V 



2S. T^T77> {^{TfrJ M log <*» j " »• (22 » 

where we expressed Q(y\x) as a function of <?(y). If this condition holds true, then Eq. d2"TT i becomes an equality. 
In Eq. d22l we split the summation over y into two parts, from to y* — [a\, and from y* + 1 to infinity. In 



the first part, e is an increasing function in y, and therefore 

y y 

q(y) = P® > E P (°) = (» + ^^(O) = (W + 1 ) e_a - < 23 > 



i=0 2=0 

Hence, the summation for y < y* is bounded as 

y* / ^ \ v " 



E (if^) Vi/) M^)) ^ E (iffe) ^ + 1 ) e ~ a 0°8(y + !)-«)■ (24) 



y=0 v y=0 
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And, multiplying by the exponential factor e £ » , we have 

E e— ( T ^-) \y + 1) log(, + 1) - £ e— "(„ + l)a. (25) 

Both summands vanish as y — » oo. The second has the form 

which decays exponentially in x, since the sum satisfies 

V / \ y °° / \y 

g(T^)(» + ^g(Tft)<» + 1) = (1 + ^ 

and e~ esX a(l + e n ) 2 vanishes for large x. Similarly, the summation 

E( T ^ r ) ! '(y + i)iog(y + i) (28) 

remains bounded, since it is the partial sum of a convergent series, with n-th coefficient (3 n (n + 1) log(n + 1) and 

(3 = e„/(l + e n ) < 1. This is verified by the checking the ratio test, as 

lim ^(n + l)log(n + l) = ^ <L 
n— >oo p n L nlogn 

Boundedness of the partial sum implies that, after multiplying times an exponential factor e~ £aX , the first summand 
vanishes as x — ► oo. 

Next, we consider the remainder of the summation in Eq. d22b . 

oo / \ y 

E (iTr) i(y)log(q(y)). (30) 

y=y*+i ^ ™ ' 

Clearly, q(0) — e~ a < q(y) < 1 and therefore —a < log q(y) < 0, so each summand is negative and bounded by 



l + e. 



Q{y)^og(q(y)) > -a(j^—\ q(y) > -a(^- 



(31) 



Summing over y, 

£ lift) log(9(y)) - + en) (irt) ■ (32) 

y=y*+l ^ ' 

Using that a = e s x(l + and taking into account the denominator e s x(l + e n ) in Eq. ( l22l i. we must study 
the behaviour of 



e e„ — ^ =_r^^ e£n -v» tv»*^t W (33) 

1 + £n / £n 



as x — > oo. By construction, y* + 1 > a, and therefore 



^ - („* + 1) i 0g (i + pi < e -f- - eJi + pi i og (i + 1 j ^ 

= e s Jl-(l + ^-)log(l+-L] |. (35) 
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Since t < (1 + 1) log(l + t) for t > 0, a fact which follows from the inequality log(l + t) < t, the left-hand 
side of Eq. ( f34b is strictly upper bounded by a function ax, with a < 0. Hence, the function in Eq. < f33T > vanishes 
exponentially as x — > oo, and so does the term 



oo / \ y 



£ s x(l + £„) 

y = y* + l 

This proves the limit in Eq. d22i > and that Eq. (f2Tb holds with equality. 



(36) 



Appendix B 

ASYMPTOTICS AT LOW £ s FOR Z = 



The mutual information is given by 



/ \ v ' 



/(X; F) = - £ P(ar) ]T Q(y|x) log £ P(x')e^~* ) - ] . (37) 

Using the Taylor expansion of the exponential e* = 1 + 1 + ^t 2 + 0(t 3 ), we notice that there are only three 
possible channel outputs to order e 3 , namely 

y = 0, Q(y\x) = l-e s x + \e 2 s x 2 + 0(e 3 s ) (38) 

y = l, Q(y\x)=e s x-e 2 s x 2 + 0(e 3 ) (39) 

y = 2, Q(y|aO = \e 2 s x 2 + 0{e 3 ) (40) 

y > 2, Q{y\x) = 0(e 3 ). (41) 

Since each of these cases behaves differently, we examine them separately. 

We rewrite the variable in the log(-) in Eq. d37l i with the appropriate approximation. When the output is y = 0, 
the variable is 

2 P(x')e s ^-^= ]T Pix'^l+e^x-x^ + ^x-x'f + Oie 3 )) (42) 

i'EA 1 x'eX 

= 1 + e a (x - fa) + \e 2 s (x 2 + /i 2 - 2x f i 1 ) + 0(e 3 s ). (43) 

Taking logarithms, and using the formula log(l + 1) = t — \t 2 + 0(t 3 ), we obtain 

e s (x - m) + \e 2 s (x 2 + ^2 - 2as/ii) - i^(a: 2 + Mi - 22^1) + 0(e 3 ) (44) 
= £s (^-Mi) + |^(i«2-^)+0( e 3). (45) 

For y = 1, the variable in the logarithm in Eq. d37l > is 

V P(x') e M*-*')^ = -Y P(x')x'(l + e s (x-x') + 0(s 2 

x x ... ^ 
x'ex x'ex 

= 

X 



(46) 

(l + fl^-^+O^)). (47) 
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Taking logarithms, and using the Taylor expansion, we get 



log^ + ^(z M i-M2) +0(e 2 s ). (48) 
x Mi 



We will later verify that no higher-order terms are required. 
At last, for y — 2, the variable in the logarithm in Eq. d37l i is 



E PW'W^ = ^ E ^V 2 (l + 0(e s )) 



I'g* x'ex 



(49) 



^(l + 0(e.)). (50) 



Again, taking logarithms, and using the Taylor expansion, we get 

log^| + 0(e s ). (51) 

Later, we will verify that no higher-order terms are required. 

After carrying out the averaging over y, we first combine Eqs. (1451 1. d48T ) and ( BTT l with the probabilities in 
Eqs. d38l-(l40li and then group all terms up to 0(gf) to derive 



(l - e s x + \e 2 x 2 ^ (e s (x - m) + \e 2 s {^ 2 - Mx)) + 



+ (s s x - e 2 x 2 ) log ^ + {xfr - M2 ) + \e 2 s x 2 log ^ + O(^) (52) 
\ Mi / *^ 

: e s (a; - /tlx) + ^e 2 (m2 - Ml) ~ e l{ x% ~ x Pi) + s s xlor ' ! 



a; 

- e^ 2 log m + el — ix 2 ^ - xfi 2 ) + \e 2 s x 2 log % + 0(e 3 s ). (53) 
x Hi x 

The expectation over x is straightforward, and gives the desired I(X;Y). 

Appendix C 

ASYMPTOTICS AT LOW £ s FOR POISSON NOISE 

The mutual information is given by 

I(X;Y) = -Y / P(x)Y,Q(yW l0 Z\ E P(*')e s4x - X) -, JL 4y\- (54) 

y=0 \ X '£X ( 1 + ^J / 

Using the Taylor expansion of the exponential, and neglecting terms of order higher than e^, the channel output 
law is given by 

«*> - f (> + - - + ^ + 1 - !)) + 0<e -»- <55) 

We next examine the logarithm in Eq. (|54l . First, the Taylor expansions of (1 + t)* 7 and (1 + t)~ v yield 
1 • ~^ e. , , , . e 2 



1 + ^ yf y _ X ) + ^(y 2 {x' - x) 2 - y(x' 2 - x 2 )) + 0(s 3 s ). (56) 
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Similarly, using the expansion of the exponential, we have 



y 

1 + Z* 2 - ) / „, \ r-2 ( / „. \ 2 / 2 „I2\ 



En ( y = I + e s ( — - l) (x' - x) + ^- [ (l - (x- x'f + yJE^l ) + 0(E 3). 

l+ e M x 



I - \ \ - / 

(57) 

Now, carrying out the expectation over x' we obtain 

E p(^.<^>£i50 = ! + £s (jl _ x) (m _ a) + 1 ( U _ a) 2 ^ + M2 _ 2 , Ml) + vA^A 

Jtx (1 + ^) V£» / 2 e„y e„ < 



(58) 



Next, using the expansion of the logarithm, we obtain 



log I E iW'^t^X) = £s f f - 1) (m - «) + 1 f f i - f Y 0* - m?) + y^M) + o (£ 3). 



(59) 



fa e -e« Ml-f ] (^- M2 ) + (^)]. (61) 



Now, multiplying by the channel law, we get for given x and y 

^ (|- - l) ( Ml - x) + | ^1 - ( M2 - ^ + 2 Mia; - 2x 2 ) + V^-ltoA + 0i el)) . ( 60 ) 
After carrying out the expectation over x, some terms cancel to give 

V 

As a final step, we sum over y to obtain the mutual information, 

00 y / 2 / o 2 \ \ 2 

E ( T ( 1 - - + ^ ) (M 2 - + °( £ ') = ^ - /*) + 0(ej). (62) 

Appendix D 

ASYMPTOTICS AT LOW £ s FOR GEOMETRIC NOISE 

The mutual information is given by 

where Q(t/|rc) is given by Eq. ©. 

As it happened in the discrete-time Poisson channel, the Taylor expansion of the exponential implies that there 
are only three possible channel outputs s to order e 3 , that is, 

s = 0, P(s\x) = l-e s x + ^(e s x) 2 + 0(e 3 s ) (64) 

s = 1, P(s\x) = e s x - (e s x) 2 + 0(e 3 s ) (65) 

8 = 2, P(s\x) = ^(e s x) 2 + 0(e 3 s ) (66) 

s > 2, P{s\x) = O(e^). (67) 
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Hence the channel output y = s + z only includes these contributions. We distinguish three cases, viz. y = 0, 
y = 1, and y > 2. 

In the first case, y = s = z = 0, and Q(y|x) becomes 

Q{y\x) = — -- — ( 1 - £.,a: + l -{e s x) 2 + O(^) ] . (68) 

1 + Sn \ 2 / 

For y = 1, we combine the outputs s = and s = 1, 

- (irtF (' + 1 1 - * 2 (i + £) + 0(e ->) ' <69) 

For y > 2, we combine the outputs s = 0, s = 1, and s = 2, 



(70) 



after combining some terms together. 

We next rewrite the numerator and denominator in the log(-) in Eq. d63l with the appropriate approximation. For 
y = 0, the common term (1 + £„) _1 cancels, and the numerator is 

Q{y\x')P{x') =(l- e s m + ^e 2 sf i 2 ) + 0(e 3 s ). (71) 
x'ex ^ -J 

In the denominator, we keep the expansion 

l~e s x + ±{e s x) 2 0{e 3 s ). (72) 

Taking logarithms of Eqs. d7TT > and (l72l . using a Taylor expansion, and combining numerator and denominator, we 
obtain 

logf 1 - e s fn + i^ M2 + ( e 3)^ _ i Ji _ £sX + l( £sX ) 2 + 0{e 3 s )\ (73) 

= -e s (m -x) + ^e 2 s (n 2 - nl) + 0{e 3 s ). (74) 
For y = 1, we use Eq. d69l ). Summing over x' in the numerator, we get 

J2 Q(vW)P(x') = [ 1 + - f J + -W + O(ef) I , (75) 

with the agreement that a common term £„/ (1 + e n ) 2 has been cancelled. 

Combining numerator and denominator, taking logarithms, and using the Taylor expansion of the logarithm, we 
obtain 

!„ _ (I + 1)^ . J±„ 5 _ i. + (i + + ^. + o(4) 

[Hx-x)e a ((I , 1\ Mi x 2 (l+e„) 2X ' 
,2 + ^J Al2+ 24 27 



£ + - J Ma + Br - ,T 2 K + 0(e»). (77) 



If the output is y > 2, in an analogous way we use Eq. d70l i to rewrite the logarithm of the ratio of numerator 
and denominator as 



log I 1 + j-m + ^2 + 0(e 3 ) J - log I 1 + ^x + ^x 2 + O(^) I . (78) 



August 20, 2008 



DRAFT 



11 



JL( Jil -x) + 1 £fa-tf.) + 0(4). (79) 



Using now the Taylor expansion of the logarithm, we obtain 

" S 

The remaining steps are the averaging over x and y. We first carry out the expectation over x. From Eq. ( l74b . 
the averaging over x yields 



(80) 



l + e„2 

Similarly, from Eq. d77b we obtain (bar for a constant factor Tj^^-p-) 

i r 



' ^(A^-^+OC^). (81) 



^ 1 + — .r :;./■ 



2 2 | 

e " J ' X 1 2 ' e 



_2 



(M?-^)^+0( £ 3). (83) 



2 



And finally, from Eq. (l79l . for y > 2, we get 

? (T+£nF" + ^) (I'"' " «' + A'" 2 - " ?) ) + 0(E '» 



(84) 



e 2 



s (/x 2 - Ata) + 0(e2). (85) 



(l + e„) y+1 2e 2 
The summation over y > 2 can be carried out and yields 

^ (1 + e n )y 1 + e„ 

y=2 



(86) 



Then, combining Eq. ( |86i > into Eq. d85l >, and summing with Eqs. ( f8TT > and ( |83l l (including the factor w ), we 
obtain 



1 1.2/ 2 x I. ■ 1 . 1 EJ 



+ e n 



+ 0(e2) (87) 



L 2 ( M 2 _ M2 ) - + 0(^). (88) 



The expansion for J(X;F) follows. 
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Fig. 1. Bit energy s b as a function of the mutual information for uniform 2 m -PEM and varying e n (geometric noise). 
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Fig. 2. Bit energy e h as a function of the mutual information or the capacity (noiseless channel, Z = 0). 
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